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THE SOLUTION TO THE KADISON-SINGER PROBLEM: YET ANOTHER PRESENTATION 


DAN TIMOTIN 


ABSTRACT. In the summer of 2013 Marcus, Spielman, and Srivastava gave a surprising and beautiful 
solution to the Kadison-Singer problem. The current presentation is slightly more didactical than other 
versions that have appeared since; it hopes to contribute to a thorough understanding of this amazing 
proof. 
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DAN TIMOTIN 


1. Introduction 

The Kadison-Singer Problem has been posed in [7] in the fifties, probably in relation to a statement 
of Dirac concerning the foundations of quantum mechanics. It has soon acquired a life of its own. 
On one hand, there have been several notable attempts to prove it. On the other hand, it has been 
shown that it is equivalent to various problems in Hilbert space theory, frame theory, geometry of 
Banach spaces, etc. However, for five decades the problem has remained unsolved. 

It is therefore very remarkable that in 2013 a proof has been given by Marcus, Spelman and Sri- 
vastava in [10]. The methods used were rather unexpected; moreover, they had shown their strength 
in some totally unrelated areas (Ramanujan graphs). They also have a very elementary flavour: most 
of the proof is based on a delicate analysis of the behavior of polynomials in one or several variables. 

In the year and a half that has passed a better grasp of the proof has been achieved, most no¬ 
tably through Terence Tao's entry in his blog IfTI) (but see also IfTTl ). It still remains an astonishing 
piece of research, obtaining spectacular results on a long standing conjecture through some not very 
complicated and apparently unrelated arguments. 

The purpose of these notes is to contribute towards a better understanding of the MSS proof. There 
is of course no pretense to any originality: the content is essentially in l llOl . with some supplementary 
simplification due to [14] (and occasionally to [15)). But we have tried to make it more easy to follow 
by separating clearly the different steps and emphasizing the main arguments; also, in various places 
we have gone into more details than in the other presentations. It is to be expected that the methods 
of [[10] might lead to new fruitful applications, and so it seemed worth to analyze them in detail. 

It is clear from the above that the notes concentrate on the MMS proof, so there will be very little 
about the Kadison-Singer problem itself and about the plethora of research that had evolved in the 
last fifty years on its relations to other domains. In particular, with one exception that we need to use 
(the paving conjecture), we will not discuss the different reformulations and equivalent statements 
that have been obtained. For all these matters, one may consult former beautiful presentations, as 
for instance |4). 

We will give in the next section a brief presentation of the original problem, as well as of another 
assertion, the paving conjecture, which has been shown soon afterwards to imply it. The description 
of the remaining part of the paper is postponed to Section [3j where the reader will have a general 
overview of the development of the proof. 

These notes have been written for a series of lectures given in December 2014 at the Indian Statis¬ 
tical Institute in Bangalore, in the framework of the meeting Recent Advances in Operator Theory and 
Operator Algebras. We thank B.V.R. Bath, J. Sarkar, and V.S. Sunder for the excellent work done in 
organizing the workshop and the conference, as well as for the invitation to present the lectures. 


2. The Kadison-Singer problem 

2.1. Pure states. The material in this subsection is contained in standard books on C* -algebras (see, 
for instance, (6)). 
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We denote by B (PL) the algebra of all bounded linear operators on the Hilbert space PL. A C*- 
algebra A C B (PL) is a norm closed subalgebra of B ( PL ), closed to the operation of taking the adjoint, 
and containing the identity. 

A state on a C*-algebra A is a linear continuous map p : A —> C, which is positive (meaning that 
p(a*a) > 0 for any a £ A), and such that p(I) — 1. One proves then that ||<|>|| — 1 and that tp satisfies 
the Cauchy-Schwarz inequality 

(2.1) \p(b*a)\ 2 < p(a*a)p(b*b) 

for all a,b £ A. 

The set S(H) of all states on A is a convex, w* -compact subset of A*. A state tp is called pure if it 
is an extreme point of &(A). 

Example 2.1. If A is commutative, then by Gelfand's Theorem it is isomorphic to C(X), the algebra 
of continuous functions on the compact space X of all characters (multiplicative homomorphisms) 
X : A —> C. The dual C(X)* is formed by all Borel measures on X, and ©(C(X)) is the set of 
probability measures on X. Pure states are precisely Dirac measures. In particular (and this is a fact 
that we will use below) a pure state on a commutative C* -algebra is multiplicative. 

Example 2.2. If A — B(PL), £ £ PL, and ||f|| = 1, then one can prove that (pg(T ) (T£,f) is a pure 

state. This fact will not be used in the sequel. 

By a theorem of Krein (see, for instance, [12. Ch.1.10]) any state cp on a C*-algebra A extends to a 
state p on B(PL). The set K^ of all extensions of tp is a convex ra*-compact subset of B(PL)*. 

Lemma 2.3. If tp is a pure state on A C B(PL), then the extreme points ofKq, are pure states ofB(PL). 

Proof. Suppose p is an extreme point of K^. If p — j(p\ + pi), with p\,p 2 states on B(PL), then 
p — j(pfA + pzlA). Since p is pure, we must have pfA = pz\A = p, so p\, p 2 £ K and therefore 

Pi — Pi — P- □ 

Consequently, a pure state p on A has a unique extension to a state on BiH) if and only if it has a 
unique pure extension to a state on B(PL). 

2.2. The Kadison-Singer conjecture. From now on we will suppose that the Hilbert space PL is 
£ 2 — P 2 (IN) and we will consider matrix representations of operators on B (P 2 ) with respect to the 
usual canonical basis of P 2 . We define T> to be the C*-algebra of operators on P 2 whose matrix is 
diagonal. Note that the map diag : B(P 2 ) —> 'D which sends an operator T to the diagonal operator 
having the same diagonal entries is continuous, positive, of norm 1. 

We may now state the Kadison-Singer Problem: 

Does any pure state on T> extend uniquely to a state onB(P 2 )? 

Although Kadison and Singer originally thought a negative answer to this question as more prob¬ 
able, in view of its eventual positive answer we will state the conjecture in the affirmative form. 

Kadison-Singer Conjecture (KS). Any pure state on V extends uniquely to a state on B(P 2 ). 
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The first thing to note is that any state p E &('D) has a "canonical" extension to 6(B(£ 2 )), given 
by 

(2.2) p(T) = ^(diag(T)). 

So the problem becomes whether p is or not the unique extension of (p to Bjl 2 ). If ip is another 
extension of p and T E B(£ 2 ), then 

p(T — diagT) = p(T) — <j>(diag T) = p(T) — <p(T). 

50 ip = p if and only if p(T — diag T) — 0 for any T E B(£ 2 ), which is equivalent to say that tp(T) = 0 
for any T E B (£ 2 ) with diag T — 0. As a consequence, we have the following simple lemma: 

Lemma 2.4. (KS) is true if and only if any extension ip E &{B{£ 2 )) of a pure state on A satisfies 

diagT = 0 => p{T) — 0. 

In fact, pure states of T> can be described more precisely Indeed, being a commutative algebra, 'D 
is isomorphic to C(X) (as noted in Example l2.lt . One can identify X precisely: it is /3JM, the Stone- 
Cech compactification of N. We do not need this fact, but will use only a simple observation. 

Lemma 2.5. If p is a pure state on V and P EV is a projection, then p(P) is either 0 or 1. 

Proof. It has been noted above (see Example I2.lt that p is multiplicative. Then p(P) — pi P 2 ) — 
p(P) 2 , whence p(P) is either 0 or 1. □ 

Remark 2.6. As hinted in the introduction, although in the original paper (7j there is no mention of 
quantum mechanics, in subsequent papers the authors state as source for the problem the work of 
Dirac on the foundation of quantum mechanics J5). For some comments on this, see Subsection |9]l 
below. 

2.3. The Paving Conjecture. Instead of dealing directly with the Kadison-Singer conjecture, we in¬ 
tend to prove a statement about finite dimensional matrices, which is usually known as Anderson's 
paving conjecture JT|. We use the notation V m to indicate diagonal m x m matrices and diag m the 
corresponding map from M m (C) to V m . 

Paving Conjecture (PC). For any e > 0 there exists r E N such that the following is true: 

For any m E N and T E B(C m ) with diag m T — 0, there exist projections Q\,... ,Q r E T> m , with 
ELi Qi = V and 

||Q,TQi|| <e||T|| 

for all i — 1,, r. 

A diagonal projection Q E T> m has its entries 1 or 0, so it is defined by a subset S C {1,..., m}. 
Thus diagonal projections Q\,... Q r E T> m with E;=l Qi ~ correspond to partitions {1,...,«} = 

51 U • • • U S r , S{ dSj — 0 for i j. 

It is important that in the statement of (PC) the number r does not depend on m. This allows us 
to deduce from (PC) a similar statement, in which C m is replaced with the whole £ 2 , is also true. We 
formulate this as a lemma. 
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Lemma 2.7. If (PC) is true, then for any e > 0 there exists r G N such that, for any T G B(£ 2 ) with 
diag T — 0 one can find projections Q\, ...,Q r eT>, with YJi=\ Qi — L and 

IlQiTQill <e||7| 


for alii — 1,..., r. 


Proof. Embed C m canonically into I 2 on the first m coordinates and denote by E m the corresponding 
orthogonal projection. For T G B(f 2 ) denote T m — E m TE m . Applying (PC), one finds diagonal 
projections Q^'^, ■ ■ ■, Qr"\ such that £- =1 and < e||T,„||. 

Now, diagonal projections in B(/' 2 ) can be identified with subsets of N, and therefore with ele¬ 
ments in the compact space {0,1} N . In this compact space any sequence has a convergent subse¬ 
quence; therefore a diagonal argument will produce an increasing subsequence of positive integers 
Mi/,, such that for each i — 1 .,r we have —> Qi for some Q,. We have 


E Qi = , lim E Qi mk) = , lim Im k = L 

k—fco k —^oo 

If £(, i] G d 2 are vectors with finite support, then <j, i] G C‘ ik for some k, and then 

\(QiTQ£,ri)\ = \(TQ&Qiti)\ = KT^qJ^£ qW j/)| - KQ^T^ 

< llQ^^Q^ljlflHIi/ll <ell 


□ 


The Paving Conjecture is actually equivalent to the Kadison-Singer Conjecture, but we will need 
(and prove) only one of the implications. 


Proposition 2.8. The Paving Conjecture implies the Kadison-Singer Conjecture. 

Proof. Fix e > 0, and suppose that r satisfies the conclusion of Lemma 12.71 Take a pure state ip G 
<5(B(l 2 )) and an operator T G B{£ 2 ) with diagT = 0. By Lemma I2l4l we have to show that ip{T) — 0. 

Let Qi be the diagonal projections associated to T by Lemma [2.71 By Lemma [2.51 f(Qi) — <p{Qi) 
is 0 or 1 for each i. Since 1 = (p{I) — Yf l= \ <p(Qi), it follows that there exists some ig for which 
<P(Qi 0 ) = 1/ while cj)(Qi) = 0 for i i 0 . 

If i f Zg, then (12. Il l implies 

MQM < f(Q*Qi)ip( R * R ) = f(Qi)f( R * R ) = o, 

and similarly f(RQi) — 0 for all R G B{1 2 ). Therefore 

<P(T) = E E HQiTQj) = HQiJQio)- 

i=ij=i 

But the projections Q; have been chosen such as to have ||Q/ 0 TQ, 0 || < e||T||, so 

MT)\ < HQto'TQ.oll <e||T||. 

Since this is true for any e > 0, it follows that ip(T) = 0, and the proposition is proved. □ 
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3. Intermezzo: what we will do next and why 

3.1. General plan. As noted above, we intend to prove the Paving Conjecture. The proof will lead 
us on an unexpected path, so we explain here its main steps. 

The Paving Conjecture asks us to find, for a given matrix T, diagonal projections Q, that achieve 
certain norm estimates (namely, ||Q;TQ;|| < e||T||). Among the different ways to estimate the norm, 
the proof in |10j choses a rather unusual one: it uses the fact that the norm of a positive operator is its 
largest eigenvalue. So we have to consider characteristic polynomials of matrices—in fact, the largest 
part of the proof is dedicated to estimating roots of such polynomials. (Although it has nothing to 
do with (KS), one should note the added benefit that we find a way to control with no extra effort all 
eigenvalues of the matrix, not only the largest one.) 

On the other hand, to achieve this control we need to make an unexpected detour: though the 
characteristic polynomial depends on a single variable, in order to control it one has to go through 
multivariable polynomials and to use the theory of real stable polynomials as developed by Borcea 
and Branden [2J. This may seem unnatural, but it should be mentioned that Borcea and Branden 
have already obtained through their methods spectacular results, in particular solving long-standing 
conjectures in matrix theory that also seemed at first sight to involve just a single complex variable [2] 
[3]. So maybe one should not be so surprised after all. 

A second feature of the proof is its use, at some point, of a random space. After obtaining certain 
results about eigenvalues of usual matrices, suddenly random matrices appear on the scene. In fact, 
the use of randomness is not really essential; it rather provides a convenient notation for computing 
averages. As noted in the previous section, to prove (PC) we need to find a partition of a finite set 
{1,..., m} into r subsets with certain properties. The random space eventually considered is finite; 
its elements are all different such partitions, and no subtle probability is used: all decompositions are 
assumed to be equally probable. What we will achieve eventually is an estimate on the average of 
the largest eigenvalue, which will lead to an individual estimate for at least one point of the random 
space—that is, for one partition. This will be the desired partition. 

3.2. Sketch of the proof. We summarize here the development of the proof. As announced above, 
we intend to discuss the eigenvalues of positive matrices, which are roots of the characteristic poly¬ 
nomial. So we need some preparation concerning polynomials and their roots; this is done first in 
one variable in Section 0] The main result here is Theorem 14.91 that shows that certain families of 
polynomials have roots that behave unexpectedly well with respect to averages. This will be used in 
Section[7]to link eigenvalues of random matrices to their averages. 

But we have to go to polynomials in several variables, namely real stable polynomials, which are 
defined by a condition on their roots. Section[5]is dedicated to real stable polynomials; after present¬ 
ing their main properties, we are especially interested in some delicate estimate on the location of the 
roots, which is done through an associated function called the barrier function. The properties of the 
barrier function represent the most technical and not very transparent part of the proof. The main 
thing to be used in the sequel is Theorem 15.81 that puts some restriction on the roots of a real stable 
polynomial. 
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We apply these facts to characteristic polynomials in Section [6] The voyage through several vari¬ 
ables done for polynomials has a correspondent here in the introduction of the mixed characteristic 
polynomial, which depends on several matrices. It happens to be the restriction to one variable of a 
real stable polynomial, and so Theorem l5.8l can be used in Theorem l6.4l to bound the roots of a mixed 
characteristic polynomial. Further, this bound translates in a bound for a usual characteristic poly¬ 
nomial in the particular case when the matrices have rank one, since then the mixed characteristic 
polynomial is precisely the characteristic polynomial of their sum. 

Section[7]introduces random matrices; as discussed above, the probability space in view is that of 
all possible partitions. The main result, Theorem 17.21 uses the results of Section 0] to show that for a 
sum of independent random matrices of rank one, the eigenvalues of its average yield estimates for 
the averages of its eigenvalues, and thus for the eigenvalues of at least one point of the probability 
space. In particular, applying this fact in conjunction with the bound on eigenvalues obtained in 
Section [6j we will obtain a partition with certain norm properties in Theorem 17.51 

Finally, this last fact is put to good use in Section[8]to obtain a proof of the Paving Conjecture. The 
first step, that uses Theorem l7.5i obtains for orthogonal projections a quantitative version of (PC). To 
go from projections to general operators is well known since several decades and may be done in 
different ways. Here we use a dilation argument taken from (15] to obtain the Paving Conjecture for 
selfadjoint matrices; going to general matrices is then immediate. 

4. Analytic functions and univariate polynomials 

4.1. Preliminaries. The next theorem in complex function theory is a consequence of Cauchy's ar¬ 
gument principle. 

Theorem 4.1. Suppose ( f„) is a sequence of analytic functions on a domain D C C, which converges uni¬ 
formly on compacts to the function f ^ 0. IfT is a simple contour contained in D such that f has no zeros on 
F, then there is «o £ N such for n > n$ the number of zeros of f n and off in the interior ofT coincide. 

The next corollary is usually called Hurwitz's Theorem if m — 1. The general case follows simply 
by induction (exercise!). 

Corollary 4.2. Suppose p n (z\,...,z m ) are polynomials in m variables, such that p n —> p uniformly on 
compacts in some domain D C C m . If p m has no zeros in D for all m, then either p is identically zero, or it has 
no zeros in D. 

If / is a polynomial of degree n with all coefficients and all roots real, we denote its roots by 

I °n(f) <■ <Pl(/)- 

Corollary 4.3. Suppose p s (z) — Ya= l a ii s ) z ‘ / with Ui : I —> R continuous functions on an interval I C IR, 
a„ (s) 0 on I. If p s has real roots for all s€l, then the roots pi(p s ), ..., p„ (p s ) are continuous functions of 

s el. 

Proof. We use induction with respect to n. The case n — 1 is obvious. Then, for a general n, we prove 
first that p\(p s ) is continuous, say in sq e I. Take e > 0, and suppose also that p So ( s o ± e) 0. By 
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continuity of a,, p s (so ± e) for s sufficiently close to sq, and so p s (z) 7 ^ 0 for z on the circle T of 
diameter [sq — e, Sq + e] (since all p s have real roots). By Theorem 14.11 all p s have at least one root 


inside T for s sufficiently close to Sg. A similar argument, using a circle at the right of sq + e, shows 


that the p s have no roots larger than b. ft follows that pi(p s ) £ (a, b) for s close to So- 

If we write now p s (z) — (z — pi(p s ))q s (z), then q s has degree n — 1 and continuous coefficients, 
so its roots are continuous by the induction hypothesis. But we have Pi(p) — pi-\(q) for i > 2. □ 

Remark 4.4. Even without the assumption that the roots are real, one can prove that there exist 
continuous functions p,(s) : I —> C, i — 1,..., n, such that the roots of p s are pi(s),... ,p„(s) for all 
s £ I. The proof is more involved; see, for instance, ( 8 } II.5.2]. 

We prove next two lemmas about polynomials with real coefficients and real roots. 

Lemma 4.5. Suppose the polynomial p of degree n has real coefficients, real roots, and the leading term 
positive. Moreover, assume that there exist real numbers a n+ \ < a n < ■ ■ ■ < a\ such that Pj(p) £ [fly+i,fly] 
for all j — 1 Then (—1 )i~ 1 p(aj) > 0 for all j = 1 

In other words, p changes signs (not necessarily strictly) on each of the intervals [a; + j,aj\. 

Proof. We will use induction with respect to n. For n — 1 the claim is obviously true. Suppose it is 
true up to n — 1, and let p be a polynomial of degree n as in the statement of the lemma. There are 
two cases to consider. 

Suppose first that the roots of p are exactly all points fly except some aj Q . Then p has only simple 
roots, so it changes signs in each of them. As p(x) > 0 for x > a\, we have p(x) < 0 on ( 02 , fli), 
etc, up to (—1 )i°~ 1 p(x) > 0 on (fly 0 + i,fly 0 _i). Therefore ( — 1 )i°~ 1 p{cij 0 ) > 0; the other inequalities are 
trivial. 

In the remaining case, there is at least one root a of p that is not among the points ay, suppose 
oc £ ( aj 0 ,aj 0 _i ). If p(z) — (z — a)q(z), then q has degree n — 1 and satisfies the hypotheses of the 
lemma with respect to the points fly with j 7 ^ 0. Then p{af) has the same sign as q(af for j < jo and 
opposite sign for j > jo; from here it follows easily that the correct signs for q (which we know true 
by the induction hypothesis) produce the correct signs for p. □ 

Lemma 4.6. Suppose the polynomial p has real coefficients and all roots real. Then 



for fllheN and x > pi(p). 


In particular, y is positive, nonincreasing, and convex for x > pi(p). 




All terms in the last sum are positive for x > p] (p), so the lemma is proved. 


□ 




THE SOLUTION TO THE KADISON-SINGER PROBLEM 


9 


4.2. Nice families. Suppose ? — {f\,... f m } is a family of polynomials of the same degree n. We 
denote 

Pt (?) := max p j (f i ), pf(?) := min Pj (fi) 

1 1 <i<m 1 1 <i<m 

Definition 4.7. For a family of polynomials F" — {/i,... f m } of the same degree n a nice family iff: 

( 1 ) the coefficient of the dominant term of every fj is positive; 

( 2 ) every fj has all roots real; 

(3) for all j — 2.,n we have 

(4-1) pf(?)<pT_ x (?). 

The usual formulation (including lUOl ) is that the /,s have a common interlacing. Since the actual 
interlacing polynomial never enters our picture, we prefer this simpler phrasing. 

Lemma 4.8. (i) {/i,.. ./,„} is nice iff every pair {f r ,fs}, r s, is nice. 

(ii) Every subfamily of a nice family is nice. 

(iii) If a e 1R, then ? — {f\,.. .f m } is nice if and only ifQ — {(x — a)f\,... (x — a)f m } is nice. 

Proof, (i) and (ii) are immediate. For (iii), there are several cases to consider: 

(1) If a € [pf o (?),pf(?)} for some j 0 , then 

pf(G) = pfi?) f or; < jo, 

ptw - pi (n pug) - pU(o) i, P - +l (Q) P r(D, 

pf(G) = pf_ r (?) for; > j 0 + 1 . 

(2) If a e (p^(?),pj Q _ x (?)) for some j 0/ then 

pf(G) = pf{F) for; < jo, 
pf 0 (G) = a, 

pf(G) = pf-i(F) for; > j 0 - 

The formulas in (1) are also valid if a > pjj (?) (taking jo = 1) or a < pf (?) (taking jo — n + 1). In 
all these cases one can easily check that (iii) is true. □ 

As a consequence of Lemma [4.81 in order to check that a family is nice we can always assume that 
it has no common zeros. 

The main theorem of this section is the characterization of nice families that follows. 

Theorem 4.9. Suppose f\,.. .,f m are all polynomials of degree n, with positive dominant coefficients. The 
following are equivalent: 

(1) ? — {fi,... f m } is a nice family. 

(2) Any convex combination of f\,.. .,f m has only real roots. 
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If these conditions are satisfied, then for any j — 1 ,...,nwe have 
(4.2) minpj(fi) < pff) < maxp ; (/;) 

for any convex combination f — Ylk hfk- 

Proof. (1) ==>- (2). We may suppose by (ii) and (iii) of Lemma [4.81 that all coefficients tj { are positive 
and that the family has no common zeros. In particular, if we denote p± — p'p(J-), this implies 
Iof <iof <p~_ x f°r ally. 

We will apply Lemma l4.5l to each of the polynomials /, and the points pZj < pf_ 1 < • • • < pf < pj. 
We obtain then, for each i — 1,..., m, that ( —1 Jfi(pj) > 0 for all j, and f(pj ) > 0. 

Fix j ; since the family J- has no common zero, at least one of /,; is nonzero in pj ~, and so (— 1 Jf(p J ) > 
0. Similarly, f(pj ) > 0. Therefore on each of the intervals (pj, pJ_J), as well as on (pf,pj), f changes 
sign (strictly), and therefore must have a root in the interior. Since there are n intervals, we have thus 
found n roots of /, and so all its roots are real. Moreover, we have obtained Pj(f) > pj for all j. 

On the other hand, we might have used, in applying Lemma [4.51 to the polynomials /„ the points 
Pn < Pn < pt-1 ■ < pj instead of p n < p j; | < • • • < p l < pj. A similar argument yields then 
pj (/) < pj for all j. Therefore the inequalities (14.2b are proved. 

(2) => (1). According to Lemma [4.8l it is enough to prove the implication for two functions fi/fit 
and we may also suppose that they have no common roots. Fix 2 < j < n; we have to prove that 
pj < pj_ v Denote ft — tf\ + (1 — f )/2 (0 < t < 1). By Corollary 14.31 the function t i —> Pj(ft) is 
continuous on [0,1] and takes only real values; so its values for 0 < t < 1 cover the interval (p ~, pj ). 

It follows that this interval cannot contain a root of either f\ or / 2 , since a common root of, say, f\ and 
ft is also a root of /y. 

Suppose then first that f\ and fi have only simple roots. Then the intervals [pj, pj] and [pj~_ v pf _, J 
have all four endpoints disjoint, and by definition PJ < PJ. v If Pj_i S (Pj ,pj )/ this would contra¬ 
dict the conclusion of the preceding paragraph. So PJ-1 > Pj and 14.1 b is proved. 

To obtain the general case, note first that ft has all roots simple for 0 < t < 1. Indeed, a multiple 
solution x of ft — 0 would also be a multiple solution of jj — jfj. But it is easy to see (draw the 
graph!) that then --J — -J—j has a single root in some interval (x — e, x + e) for at least some t' close 
to t (slightly larger or slightly smaller). However, from Theorem [4J] it follows that / f / has more than 
one root in the disc |z — x\ < e, and so f t / would not have all roots real. 

To end the proof, we apply the first step to f £ and /i_ e (e > 0), which have only simple roots. 
Then we let e —» 0 and use Corollary 14. 31 to obtain inequality d4.Hl . □ 

5. Several variables: real stable polynomials 
5.1. General facts. Denote H - {z e C : 9z > 0). 

Definition 5.1. A polynomial p(zi,... ,z m ) is called real stable it has real coefficients and it has no 
zeros in H'". 

In case m — la real stable polynomial is a polynomial that has real coefficients and real zeros. 
Genuine examples in several variables are produced by the next lemma. 
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Lemma 5.2. If A \,..., A m G M d (C) are positive matrices, then the polynomial 

m 

(5.1) q(z,zi,...,z m ) = det(zl rf + Y j z i A i ) 

i=i 

is real stable. 

Proof. It is immediate from the definition that 

q(z,z lf ... ,z,ii) — q(z,z\,..., Zm'), 
whence the coefficients of q are real. 

Assume that q(z, Z\,...,z m ) — 0, and 3z, Sz, > 0. Since zl d + Y 4 L 1 z iAi i s not invertible, there 
exists £ G C d , £ 7 ^ 0, such that 

m m 

0 = ((zl d + Y^ZiAi)£,£) = z ||£|| 2 + Y^Zi(A£,£), 

i=l i=l 

and so 

m 

0 = 9z||£|| 2 + X> ; (A ; £,£). 

i=i 

This is a contradiction, since Sz ||£|| 2 > 0 and Sz;(A,£, £) > 0 for all i. □ 

The next theorem gives the basic properties of real stable polynomials. Denote, for simplicity, by 
dj the partial derivative jf-. 

Theorem 5.3. Suppose p is a real stable polynomial. 

(i) If m > 1 and t G IR, then p(z ..., z m _i, t ) is either real stable or identically zero. 

(ii) Ift G IR, then (1 + td m )p is real stable. 

Proof, (i) Obviously p(zi, ...,z m _i,f) has real coefficients. Suppose it is not identically zero. If 
Sip > 0 is fixed, then the polynomial p(z 1 ,...,z m _ i, w) is real stable by definition. Therefore all 
polynomials p(zi,.. .,z m _i,f+(for n G N) are real stable. We let then n —t 00 and apply Corol- 
larv l4.2l to D = H m to obtain the desired result. 

(ii) We may assume t f 0 (otherwise there is nothing to prove). Suppose (1 + td m )p(z\,... ,z m ) — 
0 for some [z\,... ,z m ) G H m . Since p is real stable, p(zi,.. ■ ,z m ) 7 ^ 0. The one-variable polynomial 
q(z) p(z 1 ,... ,z m _i,z) has no roots with positive imaginary part (in particular, q(z m ) 7 ^ 0 ), so we 
may write 

n 

q(z) — c[](z — wf), < 0 . 

i =1 

Therefore 

0 = (l + fd w )p(zi,...,z m ) = (q + tq')(z m ) = q(z m ) ^1 + j , 

and, since q(z nl ) 7 ^ 0 , 

Jh 1 A Z m — Wj 

o = i + tV -= i + *E —W- 

f-^Zm-Wi *~[\z m -Wif 

Taking the imaginary part, we obtain 
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which is a contradiction, since t 7 ^ 0 and Ssw; — 3z m < 0 for all i. □ 

We will also need a lemma that uses a standard result in algebraic geometry, namely Bezout's 
Theorem (which can be found in any standard text). 

Lemma 5.4. Suppose p(z,w) is a nonconstant polynomial in two variables, of degree n in w, which is irre¬ 
ducible over 1R. There is a finite set F £ C such that, if p(zo, w 0 ) = 0 and zo F, then: 

( 1 ) the equation p( zo, w) — 0 has n distinct solutions; 

( 2 ) for each of these solutions (zo, w 0 ) we have (zo, w) 7 ^ 0 . 

Proof. First, if p(z, w) — q(z)zv n + ..., then the roots of q form a finite set F\. 

Secondly, if p is irreducible, then p and are coprime over ]R, and hence also over C. Bezout's 
Theorem in algebraic geometry states that two curves defined by coprime equations have only a finite 
number of common points, so this is true about the sets defined by p(z, w) — 0 and (z, w) — 0. Let 
F 2 be the set of the projections of these points onto the first coordinate. The set F = F| U F 2 has the 
required properties. □ 

5.2. The barrier function. Our eventual purpose in this subsection is to obtain estimates on the roots 
of real stable polynomials; more precisely, we want to show that a restriction on the roots of a real 
stable polynomial p may imply a restriction on the roots of (1 — dfjp (which is also real stable by 
Theorem l5.3ll . 

We will often use the restriction of a polynomial in m complex variables to IR m C C m . To make 
things easier to follow, we will be consistent in this subsection with the following notation: z, w 
will belong to C m (and corresponding subscripted letters in C), while x,y,s,t will be in ]R'" (and 
corresponding subscripted letters in IR). If x — (x\,... ,x m ) £ IR'”, then { 1 / > x} will denote {y — 
(yi, .. ■ ,y m ) £ lR m : y; > x, for all i — 1 ,... ,m}. 

The main tool is a certain function associated to p called the barrier function, whose one-dimensional 
version has already been met in Lemma [4.61 It is defined wherever p 7 ^ 0 by Oj, = if p(x) > 0 
it can also be written as df(x) — rJ,(log p)(xj. The argument of the barrier function will always 
actually be in IR m . 

The connection of the barrier function with our problem is given by the simple observation that if 
p(x) 7 ^ 0 and (1 — 3 ; )p(x) = 0, then O^x) = 1. So, in particular, a set on which 0 < % < 1 does 
not contain zeros of (1 — 5,)p. To determine such sets, the basic result is the next lemma, which is a 
multidimensional extension of Lemma [4.61 

Lemma 5.5. Suppose x G IR'", and p(z\,.. .,z m ) is a real stable polynomial that has no roots in {y > x}, 
then 

for any k> 0,1 <i,j< m, and x' > x. 

In particular, ifej is one of the canonical basis vectors in C m , then 1 1 —t T>^(x + tef) is positive, nonincreas¬ 
ing and convex on [ 0 , 00]. 
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Proof. The assertion reduces to Lemma l46l for m — 1 or for k — 0; and also for i = j, since then fixing 
all variables except the zth reduces the problem to the one variable case. 

In the general case, it is enough to do it for y — x, since if p has no roots in {y > x}, then it has 
no roots in {y > x'j for all x' > x. By fixing all variables except i and j, we may assume that m — 2, 
i — 1, j — 2, k > 1. Moreover, we may also assume that p > 0 on {y > x} (otherwise we work with 
— p, since Op — 3>l p ). 

So we have to prove that, if p(zi,Z 2 ) is a real stable polynomial which has no zeros in {y\ > 
x i> 3/2 > X 2 }, then 

0 - ( “ 1 )fc ^f^ (xi '* 2) = ( ~ 1 )k b 2 (a^ log?7 ) (X1,X2) = ^ ( { ~ 1 )k b 2 l ° SP ) (xi,X2) ‘ 

We will in fact prove that the map 

t (- 1 ) k ^k l ogp(t,x 2 ) 

is increasing for t > x\. It is enough to achieve this for p irreducible over 1R, since, if p — P 1 P 2 is real 
stable and has no roots in {y > x}, then the same is true for p\ and p 2 , and obviously 

( _1 ) fc ^ log P( t,X2 ) = iogP^t/* 2 ) + (-l)^logp 2 (b* 2 ). 

Suppose then that p is irreducible. For t > x\ fixed, the polynomial p(f, z) is real stable, and thus 
has all roots real; denote them, as in SectionlH by p\ (!)>■■■> p,,(t). 

Applying to p Lcmma l5.4l take t > X\ that does not belong to the finite set F therein. The functions 
Pi(t) are therefore differentiable in f, and we have 


(5.2) 

Therefore 

I 

(5.3) 


p(f,z) - c{t)Y[{z - pi{t)) 

i=i 


d k 


d k 


d k 


(-!) ^r log P ] ((-!) ^jt lo gp)(*/*2) = (-1 ( X>g( z -i°i(0) 


3z: 


d z 2 \i =i 

f (k-l )! 

fe (■ X 2-Pi(t )) k ‘ 


Z=X2 


If f > xi, we cannot have p,(f) > x '2 since then (f,p,(t)) would be a root of p in {y > x}, contrary to 
the assumption. Thus X 2 — p,(f) > 0, and in order to show that the function in (15.3b is increasing, it 
is enough to show that f i —> pi(t) is decreasing for t > x\ and all i. 

Now all pis are differentiable for f > xi, f F. To show that they are decreasing, it is enough 
to show that p\{t) < 0 for such f. Suppose then that there exists i G {1,...,«} and t > X\ such 
that p'ft) > 0; let s = p;(t). Since ^(t,s) ^ 0, we may apply the (complex) implicit function 
theorem in a neighborhood of (f,s) (in C 2 ). We obtain that the solutions of p(z\,z 2 ) — 0 therein are 
of the form (zi,g(zi)) for some locally defined analytic function of one variable g, which by analytic 
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continuation has to be an extension of p to a complex neighborhood of t. So g'(t) — p'At), and in the 
neighborhood of f we have 

g(z 1 ) -f + p-(t)(zi-t) + 0(|zi-t| 2 ). 

If 3zi > 0 and small, one also has Sg(zi) > 0. We obtain thus the zero (zi,g(zi)) of p in H 2 , 
contradicting the real stability of p. This ends the proof of the lemma. □ 

Corollary 5.6. Suppose x G K m , and p is a real stable polynomial, without zeros in {y > x}. Then Op (y) < 
Op (x) for any y > x and j — 1,..., m. 

Proof. If p has no zeros in {y > x\, obviously it has no zeros in {y > x'\ for any x' > x. Therefore, 
by Lemma [5.51 the function t *—? Oj, (x 1 + tef) is nonincreasing on [0, oo) for any i — 1 ,,m. We have 
then 

O j p(xi ,...,X m ) > Op(y!,X2/ ■ • - ,X m ) > &p(yi,y2,X3,-■ -fX m ) >■■ > O', (\J X ,..., y m ) □ 

The main monotonicity and convexity properties of Op are put to work in the next lemma to obtain 
a restriction on the location of zeros of (1 — dj)p. As noted above, we will use the condition Oj, < 1, 
but in a more precise variant which will lends itself to iteration. 


Lemma 5.7. Let x G ]R'”, and p a real stable polynomial, zvithout zeros in {y > x}. Suppose also that 

^p( x ) + ^ < 1 

for some j G {1 .,m} and S > 0. 

Then: 

(i) (1 — dj)phas no zeros in {y > x}. 

(ii) For any i — 1,..., m we have 


<1 > ( 1 _30p(* + *y) < 


Proof. By Corollarv l5.6l we have 

= "W < t(l ) < 1 A < !, 

so djp(y) A P(V), or (1 - dj)p{y) A 0. 

To prove (ii), note first that (1 — dfp — p( 1 — Op), whence log[(l — dfp] — log p + log(l — Op), 
so, by differentiating, 

, t ,' _ j,/ _ 


The required inequality becomes then 

3/0 ; p(x + Sej) 


(5.4) 


By Corollarv l5.6l we have 


1 - Op(x + Sej) 


< 0 ! p(x) - <P‘ p (x + Sej) 


oj,(x + Sej ) < oj,(x) < 1 - 
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or 


< 6. 


1 -<J >* p (x + 6ej) 

Further on, p has no zeros in {y > x + Sej }, so Lemma 15.51 (a p p lied in x + Sej) implies, in particular, 
that —di<&p(x + Sej) > 0, whence 

9;Op(x + <5ey) _ -j , . % 

- t --— < — 5di<& v (x + Sej). 

1 — & p (x + 6ej) 

To prove 15.4b . it is then enough to show that 

—5di&p(x + Sej) < <E>p(x) — ^(x + Sej). 

Using dj<&p(x + Sej) — 3y<3T(x + Sej), the inequality can be written 

%(x + Sej) < Oj/x) + Sdj<t>p (x + Sej). 

This, however, is an immediate consequence of the convexity of the function f i —> <J>), fx + fey), that 
has been proved in Lemma l5.5l □ 

Finally, the next theorem is the main result of this section that we will use in the sequel. 


Theorem 5.8. Let x e ]R m , and p a real stable polynomial, without zeros in {y > x}. Suppose also that 

2 

(-U/ • • ■ / Xm) T — ^ 1 

/or some S > 0 ond j — 1 ,...,m. Then 

m 

n a-aop 

i=l 

has no zeros in {y > x + <5}, ro/zere I := (S,...,S) € 1R” ! . 

Proof. The proof follows by applying Lemma 15.71 successively to ) = 1 and x, then to j — 2 and 
x + Se i, etc. □ 


6. Characteristic and mixed characteristic polynomials 

6.1. Mixed characteristic polynomial. We intend now to apply the results of Section[5]to polynomi¬ 
als related to matrices. Our final goal is to estimates eigenvalues; that is, roots of the characteristic 
polynomial. But we will first consider another polynomial, attached to a tuple of matrices. 


Definition 6.1. If A\,..., A m G Mrf(C), then the mixed characteristic polynomial of the matrices A, is 
defined by the formula 


( 6 . 1 ) 


y[A 1 ,...,A m ](z) -a *') det ( zI d + Yj z i A >) 

i=l i=l 


’— z m —0 


It is easily seen that if we fix m — 1 of the matrices A\,..., A m , then }t[Ai ,..., A m ] (z) is of degree 
1 in the entries of the remaining matrix. Indeed, if we develop the determinant that enters J6.1b . then 
any term that contains a product of, say, k entries of Aj has also the factor zjr If we apply (1 — dj), we 


are left with J) 1 , and if k > 2 this terms becomes 0 if z,- = 0 . 
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Example 6.2. For one or two matrices we have 

}i[Ai\(z) — z d — Tr A 1 ifm — 1, 

p[A 1 ,A 1 \(z) = z d -z d ~ 1 (TrA 1 +TrA 2 ) + z rf ~ 2 (Tr A a Tr A 2 -Tr(AiA 2 )) if m = 2. 

In the general case, the coefficients of p[A \,..., A,„](z) are certain expressions in the traces of 
monomials in A \,..., A m that are well known in the invariant theory of matrices (see [13]). 

The results in Section[5]have consequences for the mixed characteristic polynomials. 

Theorem 6.3. Suppose A \,..., A m G Mj(C) are positive matrices. Then p[A \,..., A m ] (z) has only real 
roots. 

Proof. We have seen in Lemma l5.2l that the polynomial q defined by (15.1b is real stable. But p [A |, ... ,A m 
is obtained from q by first applying (1 — 3,) for i — 1 ,m and then specializing to z\ — ■ ■ ■ — z m — 

0. By Theorem l5.31 these operations preserve the real stable character. So p\A\,..., A m ] is a real stable 
polynomial of one variable, which means exactly that it has real roots. □ 

Remember Jacobi's formula for the derivative of the determinant of an invertible matrix: 

< 6 - 2) {J ^rnf= TrjMW-'M'W). 

Theorem 6.4. Suppose A \,..., A m e Mrf(C) are positive matrices, such that Ya=\ — Id an d Tr A, < e 
for each i = 1,..., m. Then any root ofp[A \,..., A m ] is smaller than (1 + y/e) 2 . 

Proof. The polynomial 

m 

p{z) := det(X>iA). 
i=1 

is real stable, being the specialization of the polynomial q in ( 15.111 to z = 0. If t > 0 and t 
(t,...,t) e C d , then, for y > t we have Yh=\ 3 h^i > T 4 L 1 tA i — ^d- Therefore YJlLi ViA, is invertible, 
and p{y) ^ 0. 

We may apply Jacobi's formula 16.211 in order to compute the barrier function Oj,, and we obtain 

m 

(*!,..., x m ) = Tr ((J2 z i A i)~ lA j). 

i=l 

In particular, if t > 0, then 

<J>j,(f,...,f) = Tr (r'Aj) < y. 

It follows then from Theorem 15.81 that, if we t, S > 0 are such that f + y < 1, then ]~J”L 1 (1 — 3, )p has 
no zeros in { 1 / > (t + S,... ,t + 5)}. The choice t — e + \fc, (5 = 1 + \[e (which can easily be shown 
to be optimal) yields t + S — (1 + \fe) 2 , and therefore p has no roots y with i/ ; > (1 + \fe) 2 } for all i. 
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Now, using the relation Ya=\ A i — 1/ one obtains 

m m 

y[A 1 ,...,A m ](z) = n(! “ d et(zl rf + E Z A/)|, = , m=0 

i=l i=l 

m m 

- 11(1 - 3,0 det(£ Wi Ai) | =Wm=z 
i=l i=l 

m 

= U(l-9i)p{z,z .z), 

i=i 

which cannot be zero if z > (1 + \A) 2 - Therefore all roots of }t are smaller than (1 + y/e) 2 . 


□ 


6.2. Decomposing in rank one matrices and the characteristic polynomial. In an important par¬ 
ticular case the mixed characteristic polynomial coincides with a usual characteristic polynomial. 
Remember this is defined, for A e M d ( C), by p A (z) = det{zl d - A). 


Lemma 6.5. Suppose B, A\, ..., A m e M d (C), and A\, ..., A m have rank one. Then the polynomial 

(zi,... ,z m ) i y det(B + Z\A\ + • • ■ + z m A m ) 
is of degree < 1 separately in each variable. 


Proof. By fixing all the variables except one, we have to show that, for any B, A\ € M d (C), A\ of rank 
one, the function 

z i-t det(B + zAj) 

is of degree at most 1. This is obvious if we choose a basis in which the first vector spans the image 
of A\, and we develop the determinant with respect to the first row. □ 


Suppose now p(z\,... ,z m ) is a polynomial of degree < 1 separately in each variable. Then p is 
equal to its Taylor expansion at the origin of order 1 in each variable, that is: 


p(z 1 ,...,Z m ) = Yj c ei,... 


. . . 7 e tn 

f 


with 


Therefore 


e i€{ 0 /l} 

Ce l.*» = ^ ' --^TpizVl,...,ZV,n)\ Wl = ... =Wm=0 - 

p{z 1 ,...,Z m ) = Y Z 1 ' " Z m d l 1 ■ ■■ d mP(Wl,---,W m )\ Wl= ... =Wm=0 


e;€{ 0 ,l} 


= n(l + z i 3f)p(a;i,...,Wm)| Wl= ... =WB=0 . 

i=l 

In the case of the polynomial in Lemma [6.51 this formula becomes 

m m m 

det(B + Y z i A i) = IK 1 + z &) det ( B + E ^i A i)L = ... =Wm=0 - 

i= 1 1=1 i=l 

In fact, we are interested by this last formula precisely when B — zl d and all z, = —1. We obtain 
then the next theorem. 
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Theorem 6.6. Suppose A \,..., A m £ M t f(C) have rank one. If A — A\ + • ■ • + A m , then 

Pa(z) = p[A l ,...,A m ](z). 

Remark 6.7. The mixed characteristic polynomial and the usual characteristic polynomial are invari¬ 
ant with respect to a change of basis. So, although we have spoken about matrices for convenience, 
the statements of Theorems 16.41 and 16.61 can be stated for A \,..., A m £ £(V), where £(V) denotes 
the space of linear operators on the finite dimensional vector space V. 

7. Randomisation 

7.1. Random matrices and determinants. Let (fi, p) be a finite probability space. If X is a random 
variable on Q, the expectation (or average) of E(X) is defined, as usually, by 

E(X) := £ p(to)X(w). 

wed 

If X \,..., X m are independent random variables, then, in particular, we have 

(7.1) E(X 1 ---X m ) =E(X 1 )...E(X m ). 

We will use random matrices A(oi) £ Mj( C), whose entries are random variables; then E(A) is 
the matrix whose entries are the expectations of the corresponding entries of A. The random matrices 
Ai, A 2 are called independent if any entry of A | is independent of every entry of A 2 . Also, when we 
say that a random matrix A(<u) has rank one, this means that A (ay) has rank one for any cv £ Cl. 

The characteristic polynomial p a of a random matrix A is also a random variable, by which we 
mean that its coeficients are random variables. Then the polynomial E(p^) has as coefficients the 
expectations of the coefficients of A. 

Theorem 7.1. Suppose Ai(a;),..., A m (a;) are independent rank one random matrices in M^(C), and A — 
Ai + • • • + A m . Then 

E(Pa) = ^[E( A i),... ,1E(A,„)]. 

Proof. By Theorem l6.6l we have, for each co £ Cl, p = p[Ai(co) r .. ,,A m (cv)]. By taking expecta¬ 
tions, 

E(p A ) = E(p[Ai(a;),.. .,A m (a7)]). 

Now independence of A,s combined with the multilinearity of U implies that 

E(p[Ax(a;),...,A,„(a;)]) = p[E(Ai),.. ,,E(A m )], 

which ends the proof. □ 

We can say more if we also assume that the A,s are all positive. 

Theorem 7.2. Suppose Ai(a>),..., A m (a;) are independent rank one positive random matrices in Md(C), 
and A — Aj H-1- A m . Then, for any j = 1,..., d, we have 


min Pj(p AM ) < |0/(p[E(A 1 ),...,E(A ffl )]) < max Pj (p A{w) ). 
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(7.2) 


min / 

coECl 


Proof. We prove only the left hand side inequality; the right hand side is similar. It is enough to show 
that for any i — 1 ,... we have 

vpjfulAifw),..., A,-_i(o;),A / (a;),E(A,- +1 ),.. .,E(A m )]) 

< minm(^[Ai(a;),...,A ; _ 1 (a;),E(A ; ),E(A; + 1 ),...,E(A m )]). 
wen 

Indeed, for i — in the left hand side coincides with min^gQ Pj(PA(w)) by Theorem 16.61 while for i — 1 
the right hand side is precisely Pj(p[ E(Ai),... ,E(A m )]). The chain of inequalities corresponding to 
i — 1,2,...,m proves then the theorem. 

Fix then i and co £ Q, and consider the family of polynomials 

fw> = f[ a i(w), ..., A/_!(a’) ; A/(m , ) / E(A, +1 ),.. .,E(A m )], a/ <E Cl. 

Take c w i > 0 , with Y,w'en c w' — 1- By the multilinearity of the mixed characteristic polynomial, we 
have 

E c w'fw' = E c^[Ai(aO,...,A / _ 1 (w),A;(a/),E(A i+1 ),...,E(A„ i )] 


a/EO 


cj'eCI 


= }i[A 1 (co),...,A i _ 1 (co), E c a ,/A,(a/),E(A i - + 1 ),...,E(A„ i )]- 

a/EO 

Since the last polynomial is the mixed characterstic polynomial of positive matrices, it has all roots 
real by Theorem 16.31 It follows by Theorem 14.91 that {f w i : co' € Q} is a nice family. Moreover, if 
we take as coefficients of the convex combination c w i — p (co') and so for any j — 1,... ,d we have 
Eu/en c a ,/A;(m / ) = E(A,). Applying the last part of Theorem 14. 9l it follows that for any j = I ,..., d, 

min pj(}t [Ai (co),..., A A ; (a/),E(A i+1 ),..., E(A,„)]) 

w'en ’ 

Pj(p [Ai (co),..., A~i-i(co), E(A f ), E(A ;+1 ),... ,E(A m )]). 

Taking the minimum with respect to co G Q, we obtain 


(7.3) 


min min pfji[Ai(co),..., Ai_ x (co), A,(a/),E(A, + i),..., E(A,„)]) 
wedaj'sn ; 


minpj(p[A 1 (co ),..., A / _ 1 (a;),E(A / ),E(A /+1 ),.. .,E(A„,)]). 
wen 

Suppose the minimum in the left hand side is attained in co — ay, co' — co g. By independence of the 
random matrices A,, we have 

p({e-G Q : A a (cr) = A^oy),..., A;_ a (cr) = A/.^oy), A;(cr) = A^)}) 

p({cre Q : Ai(o-) = A 1 (a; 0 ),...,A / _ 1 (cr) = A;_i(ay)})p({cr G O : A/(<r) = A ( a^)}) > 0. 

Taking Pq S Q in the set in the left hand side, we obtain 

minpd^A! (o),..., A,-_ a (o), A i (cr) / E(A i+1 ),... ,E(A W )]) 
oe n ‘ 

< Pj(} l [A\(cro ),..., A,_ 1 (cro), A,(po),E(A; + i),. . .,E(A m )]) 

= |0/(/'[A-iO’o),. • ■, A,_i(ay), A,(a;o),E(A ;+1 ),.. .,E(A,„)]) 

= min min m(^[Ai(a;),..., A,_i(a>), Aj(co'), E(A m ),...,E(A m )]). 
weaken 1 

This inequality, together with Il7.3b . implies 117.21 . finishing thus the proof of the theorem. □ 
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Remark 7.3. The point of Theorem 17.21 is that the middle term might be easier to compute or to 
estimate. But, since the matrices E(Ai),..., E(A,„) are not of rank one. Theorem 16.61 does not apply, 
and /rpE(Ai),..., E(A m )] is not a characteristic polynomial. However, Theorem 17.21 tells us that its 
roots can be used to estimate the eigenvalues of A(cu) for at least some value of oj. 

Corollary 7.4. Let Ai(m),..., A m (co) be independent rank one positive random matrices in and 

A — Ai + ■ ■ ■ + A Suppose E(A) = 1^ and E(Tr A,) < efor some e > 0. Then 

min || A(w) || < (1 + \/e) 2 . 
wen 

Proof. Since Tr(E(A,)) = E(TrA,) < e, the matrices E(Ai),.. .,E(A m ) satisfy the hypotheses of 
Theorem l6.41 all roots of //[Ef Aj),... ,E(A m )] are smaller than (1 + \fe) 2 . By Theorem 17.21 vve obtain, 
in particular, 

min ||A(cu)|| = min pi{p A{w \) < pi(/*[ E (Ai),.. ,,E(A m )]) < (1 + y/e) 2 . □ 

7.2. Probability and partitions. The last theorem of this section gets us closer to the paving conjec¬ 
ture. It is here that we make the connection between the probability space and the partitions. Let us 
first note that, similarly to Remark 16.71 one can see that the independence condition is not affected 
by a change of basis. So in Theorem 17.21 and in Corollary 17.41 we may assume that A; take values in 
£(V) for some finite dimensional vector space V. This observation will be used in the proof of the 
next theorem. 


Theorem 7.5. Suppose A \,..., A m 
and 11 A,-1| < C for all i — 1 
{1 ,..., m}, such that 


G Mrf(C) are positive rank one matrices, such that YX=\Ai — h 
Then for every positive integer r there exists a partition Sj,..., S r of 


E A i 


< 



2 


for any j — 1,..., r. 


Proof. Since the purpose is to find a partition with certain properties, we will take as a random space 
O precisely the space of all partitions of {1,..., m } in r sets, with uniform probability p. Such a 
partition is determined by an element to — (a ..., cv m ), where tOj G r}, and Sj = {k : co^ — 

j}; so n — {1 ,... , r\ m . Also, the different coordinates, that is the maps co i —> a>j, are independent 
scalar random variables on Q. 

We consider the space V C d © • • • © and define the random matrices A,- (i = 1,..., m) with 
values in £(V) by 

(7.4) A,'((u) = 0 © • • • © rAj © • • ■ © 0, 

where rA, appears in position co r 

These are independent random matrices (since the coordinates cu, are independent). If we fix 
1 <j<r, then cot — j with probability 1/r, and so rAj appears in position j with probability 1 / r. 
Therefore 

1 1 

E(A/) = -rA t © • • • ® -rA { — A,- © A ; - © • • • © A,-. 
r r 
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If A — Aj + ■ ■ • + A m , then 


E(A) = E E ( A i = E( A i© A,® A ; ) = I v . 

i=l i =1 

Since Tr Afcv) — r Tr A, for all i, we have 

lE(TrA ; -) = E(rTr A,) = rE()||A;||) < rC. 
Corollary 17. 4l vields the existence of a> G fl such that 

||A(a’)|| < (1 + VrC) 2 . 

But, according to (17.4b , we have 

• • • © f r E A,-. 


A M = h' E A ; 

V Wi=l 2 


f E A * 

G "i= 2 2 


We define then Sj — {i : ay = /}. It follows that \\r YlieSj A ;| A (1 + \frC) 2 for alland dividing by 
r ends the proof of the theorem. □ 

8. Proof of the Paving Conjecture 

We may now proceed to the proof of the paving conjecture; from this point on all we need from 
the previous sections is Theorem 17.51 We first deal with orthogonal projections. For such operators 
the paving conjecture is trivially verified (exercise: if P is an orthogonal projection and diag P — 0, 
then P — 0). But we will prove a quantitative version of the paving conjecture, in which one does 
not assume zero diagonal. 

Lemma 8.1. Suppose P € M,„(C) is an orthogonal projection. For any reN there exists diagonal orthogo¬ 
nal projections Qj,..., Q r € M m ( C), with YJj=-\ Qj — hn, such that 


IIQ/PQjll < ( + V II diagP|| 


for all j — 1,, r. 


Proof. Denote by V the image of P, and d — dim V. Let (ef) r f =1 a basis in C m , and define on V the 
rank one positive operators A,- by Afv) — (v, P{ei))P{ei). We have 

II A ill < ll p (c)H 2 = (P(ef)/Ci> < 1 diagP||, 


( 8 . 1 ) 

and, for DgV, 
( 8 . 2 ) 

Consequently, 

whence 


{AiV,v) = (v,P(ei))(P(ei),v) = \(v,Pei )\ 2 = \ {v,ej )\ 2 


(E A-iV,v) = J2( A ‘ V ' V ) = E K»/ e dr = \w\\ 

i=l i=l i=l 


E A = ] V- 

i=i 


(8.3) 
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From d 8 .Hl and (18.3b it follows that Aj satisfy the hypotheses of Theorem 17.51 with C = | diagP|| c 
There exists therefore a partition Si,..., S r of {1,..., m}, such that 

2 


ieS; 


< 


1 

7r 


+ \/II dia S p llc 


for any j — 1,..., r. 


Define then Qj £ M m (C) to be the diagonal orthogonal projection on the span of \e, : i £ S,}. 
Then 

WQjPQjW = II QjPiQjPyW = WQjPf = IIQMI 2 - 

But, if v £ V, then, applying d8.2L 

\\Qjv\\ 2 = E \ ( v ' e i)\ 2 = Y2( A i v ' v ) = <( E A i) v ' v ) < II E A /ll ■ IMI 2 - 


z’GSv 


i £ S j 


ieS, 


iESj 


So 


\\QjPQ } \\ = \\Q\v\\ 2 <tLM< (^ + \/\\^p\\°°) / 

ieS 7 - VvT / 

and the lemma is proved. 


□ 


Theorem 8.2 (The paving conjecture). For any e > 0 ffere exists r £ N such that, for any m £ N 
and T £ M m (C) zczf/z diag T — 0 f/zere exist diagonal orthogonal projections Qi,..., Q r £ M m (C), with 
E/=i Qj = hn, such that 

WQ/TQjW < e\\T\\ 

for all j — 1,, r. 

Proof. Suppose first that T — T* and ||T|| < 1. The 2 hi x 2/zz matrix 

/ i+I l (Jm _ T 2 ) l/2\ 


P = 


Im — T 


is an orthogonal projection and diag P = Q, ■ • •' 2 ) ■ Choose r large enough to have 2 — 7 = + -E j ~ - 
1 < e. It follows from Lemma 18.11 that there exist diagonal projections Q ",..., Q” G M 2 m (C) with 

Ef=i Q" = hd and HQ-'PQfll < (77 + 75) 2 for all i = 1.r. 

Let Q" — Qi + Q'j be the decomposition of Q" in the diagonal projections corresponding to the first 
m and the last m vectors of the basis of C 2m . So E;=i Qi — E;= 1 Qi — hn and, for each i — 1 ,..., m, 

(8.4) ||Q,( I + r)Q,||<2(-L + -L) 2 , || Q ; ( ,-r)Q;i|<2(-L + -L) 2 . 

, 2 


The first ini 


inequality implies that Q;(f + T)Q ; < 2 Q;, 


so 


(8.5) 


- Qi < QiTQi < 


2 l 7 ? + 7 ?) - 1 


Qi < eQi 


(the left inequality being obvious). Similarly, the second inequality in (18.4b yields 


( 8 . 6 ) 


-eQ'< 


1-2 


j_ j_y 

7 = r + V2j 


Qi < Q'TQ' < Qj- 
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If we define Q,j — Q,Q' ( i,j — 1 , ... , r), then E^-=i Qij — hn, and it follows from (18.5b and (18.6b that 

-eQij < QijTQij < eQij, 

or QijTQij < e. The theorem is thus proved for T a selfadjoint contraction, and it is immediate to 
extend it to arbitrary selfadjoint matrices. 

If we take now an arbitrary T E M m (C), with diag T — 0, we may write it as T — A + iB, with A, B 
selfadjoint, ||A||, ||B|| < ||T||, and diag A = diagB — 0. Applying the first step, one finds diagonal 
projections Q ' v ..., Q', Q '{,..., Q" E M m (C), with E;=l Q'i = E/=l Q" = l m, ||Q'^Q-|] < f Ill’ll and 
II Q'/BQ'/W < § || T|| for i = 1 ,..., r. If we define Q f/ - Q'Q", then £[ j=1 Q hj - I nu and || Q i; -TQ f/ || < 
e||T|| fort,; = □ 

By writing carefully the estimates in the proof, one sees also that we may take r of order e 4 . 

9. Final Remarks 

1. As noted in Remark 12.61 there is a connection between the Kadison-Singer problem and quan¬ 
tum mechanics. We will give here a very perfunctory account. In the von Neumann picture of quan¬ 
tum mechanics, states (in the common sense) of a system correspond to states <p (in the C*-algebra 
sense) of B(TL), while observables of the system correspond to selfadjoint operators A £ B(TL). The 
value of an observable in a state is precisely <p(A). 

A maximal abelian C*-algebra A C B(Ti ) corresponds to a maximal set of mutually compatible 
observables. If the extension of any pure state on A to a state on B(TL) is unique, then one can say 
that the given set of observables determines completely all other observables. This seems to have 
been assumed by Dirac implicitely. 

Now, there are various maximal abelian subalgebras of B{TL), but the problem can essentially 
be reduced to two different basic types: continuous (that are essentially isomorphic to L°° acting as 
multiplication operators on L 2 ) and discrete (that are isomorphic to T> acting in t 2 ). The main topic of 
the original paper 0 is to prove that extension of pure states is not unique in general for continuous 
subalgebras. They suspected that the same thing happens for the discrete case, but could not prove 
it, and so posed it as an open problem. 

2. We have said in the introduction that there are many statements that had been shown to be 
equivalent to (KS), besides (PC) that we have used in an essential way. We have thus, among others: 

(1) Weaver's conjectures in discrepancy theory. The original proof in [10] goes actually through 
one of these; the shortcut using (PC) is due to Tao fT4| . 

(2) Feichtinger's conjecture in frame theory. 

(3) Bourgain-Tzafriri conjecture. 

All these conjectures have in fact different forms, weaker or stronger variants, etc—a detailed 
account may be found in [4]. It is worth noting that up to 2013 most specialists believed that they 
are not true, and that a counterexample will eventually be found. So it was a surprise when all these 
statements were simultaneously shown true by 1101 . 
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3. The method used in [10] is even stronger than described above. Actually, its first application 
was to a completely different problem in graph theory: the existence of certain infinite families of 
so-called Ramanujan graphs [9] (see also IllTI for an account). 

4. The most tedious proof in the above notes is that of Lemma [5.51 The original argument in |10] 
is more elegant, but uses another result of Borcea and Branden [3] that represents real stable poly¬ 
nomials in two variables as determinants of certain matrices—a kind of converse to Lemma [5721 The 
direct argument we use appears in [14]. 
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